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Abstract 

We study the time change of the relation between the 
rank of a racehorse in the Japan Racing Association 
and the result of victory or defeat. Horses are ranked 
according to the win bet fractions. As the vote pro- 
gresses, the racehorses are mixed on the win bet frac- 
tion axis. We see the emergence of a scale invariant 
relation between the cumulative distribution function 
of the winning horse x\ and that of the losing horse xq. 
x\ oc Xq holds in the small win bet fraction region. We 
also see the efficiency of the market as the vote pro- 
ceeds. However, the convergence to the efficient state 
is not monotonic. The time change of the distribution 
of a vote is complicated. Votes resume concentration 
on popular horses, after the distribution spreads to a 
certain extent. In order to explain scale invariance, we 
introduce a simple voting model. In a 'double' scaling 
limit, we show that the exact scale invariance relation 
x\ = Xq holds over the entire range < xq, x\ < 1. 



1 Introduction 

Racetrack betting is a simple exercise of gaining a profit 
or losing one's wager. However, one needs to make a 
decision in the face of uncertainty and a closer inspec- 
tion reveals great complexity and scope. The field has 
attracted many academics from a wide variety of dis- 
ciplines and has become a subject of wider importance 
PQ . Compared to the stock or currency exchange mar- 
kets, racetrack betting is a short-lived and repeated 
market. It is possible to obtain starker views of aggre- 
gated better behaviour and study the market efficiency. 
One of the main findings of the previous studies is the 
'favorite-longshot bias' in the racetrack betting market 
[2j[3]. Final odds are, on average, accurate measures 
of winning and short-odds horses are systematically 



undervalued and long-odds horses are systematically 
overvalued. 



From an econophysical viewpoint, racetrack betting 
is an interesting subject. Park and Dommany have 
done an analysis of the distribution of final odds (div- 
idends) of the races organized by the Korean Racing 
Association [4]. They found power law behaviour in 
the distribution and proposed a simple betting model. 
Ichinomiya also found the power law in the races of 
the Japan Racing Association (JRA) [5] and proposed 
another betting model. We have studied the relation 
between the rank of a racehorse in JRA and the result 
of victory or defeat [6]. Horses are ranked according 
to the win bet fractions. We studied the distribution 
of the winning horses in the long-odds region [B]. Be- 
tween the cumulative distribution function of the win- 
ning horses X\ and that of the losing horses Xq, we find 
a scale invariant relation x\ oc Xq with a = 1.81. We 
show that in a 'Polya' like betting model, where bet- 
ters vote on the horses according to the probabilities 
that are proportional to the votes, such a scale invari- 
ance emerges in a self-organized fashion. Furthermore, 
the exact scale invariant relation x\ — x$ holds ex- 
actly over the entire range < Xo,Xi < 1 in a limit. 
We also studied a voting model with two kinds of vot- 
ers, independent and copycat [7]. We find that a phase 
transition occurs in the process of information cascade 
and it causes a critical slowing down in the convergence 
of the decision making of crowds. 



In this paper, we study the time series data of the 
votein JRA races in detail. We show that as the vote 
progresses, scale invariance and market efficiency do 
emerge in the rank of the racehorses. The decision 
making process in betting is not simple and some subtle 
mechanism does work. We explain the scale invariance 
based on a simple betting model. 



2 Racetrack betting process 

We study the win bet data of JRA races in 2008. A 
win bet requires one to name the winner of the race. 
There are 3542 races and we choose 3250 races whose 
final public win pool (total number of votes) V r is in 
the range 10 5 < V r < 10 6 ,r G {1, 2, • • • , R = 3250}. 
H r horses run in race r and 7 < H r < 18. We remove 
133 cancelled horses and the total number of horses 
N = J2 7 Li H r is 47273. There are 3251 winning horses 
(one tie occurs) and are denoted as Ni — 3251. Of 
course, the remaining 44022 horses are losers and are 
denoted as No = N — N% . The number of times race 
r is announced is denoted as K r and is in the range 
13 < K r < 262. The total number of announcements 
is K = Yl r =i — 285269. The time to the race entry 
time (start of the race) in minutes is denoted as T£. We 
denote the odds of the ith horse in race r at the fcth 
announcement as O r ik and the public win pool as V k . 
I\ denotes the results of the races. = 1(0) implies 
that the horse wins (loses). A typical sample from the 
data is shown in Table [TJ 



We define x\{t) and T r {t) as 

= xIm{v) ( 4 ) 
T r (t) = Tl r{v) . (5) 

We denote the average value of T r (t) as T(i) and it is 
defined as 

r (*)^E?W ^ 

r— 1 
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Table 1: Time Series of odds and pool for a race that 
starts at 13 : 00. H r = 10, K r = 52 and we show the 
data only for the first three horses 1 < i < 3. The first 
horse wins in the race (/[ = 1,1% = 1$ = 0) . 
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From O r ik , we estimate the win bet fraction x r ik by 
the following relation. 

0.788 
' Oik' - 1 ' 

If the above does not sum up to one, we renormalize it 
as x r ik — „H^.' fc - ■ Hereafter, we use x r ik in place of 

We use the average public win pool as a time variable 
t of the whole betting process. For each v, we choose 
the nearest V[ k and use the average value of Vf k as the 
time variable t. More explicitly, we define t as 

R 

r— 1 

k r (v) = {k\Min k \V{-v\}. (3) 



Figure 1: Relation between average time to start T and 
average number of votes t. 

Figure [1] shows the relation between T and t. We see 
a rapid growth of the average number of votes t as we 
approach the start of the race (T — > 0). Almost half 
of the votes are thrown in the last ten minutes. We 
choose five timings from the voting process and denote 
them as tj,j £ {0, 1, 2, 3, 4}. to corresponds to the first 
voting data k — 1 in each race, to ~ 70.4 and T(to) — 
620.6[min]. At t 1 ~ 2342.9, T{t x ) ~ 440.2 [min]. At 
t 2 ~ 15482.0, T(t 2 ) ~ 167.6[min] and at t 3 ~ 195201.3, 
T(ts) ~ 2.5[min]. t± corresponds to the last voting 
data k — K r in each race, ~ 249708.8 and T(^) ~ 
— 1.1 [min]. All data are summarized after the start of 
race. The reason why we choose U,i £ {1,2,3} will 
become clear in section 4. 

In order to see the betting process pictorially, we 
arrange the N horses in the order of the size of x\ (t) . 
We denote the arranged win bet fraction as x a (t),a G 
{1,2,- -.,#}. 

asi(t)>x 2 (*)>ass(*)>--->x^(*) (7) 

I a (t) tells us whether horse a wins (I a — 1) or loses 
(I a =0). In general, the probability that the horse 
with large x a (t) wins is big and vice versa. We ar- 
range the horses in the increasing in order of a from left 
to right. On the left-hand side of the sequence, more 
strong horses exist. On the right-hand side, there are 
more weak horses. If the win bet fraction does not con- 
tain any information about the strength of the horses, 
I a (t) is one and zero randomly. Conversely, if the in- 
formation is perfectly correct, the first N x horses' I a (t) 
are 1 and the remaining No horses' I a (t) are 0. 




Figure 2: Pictorial presentation of the betting process. 
We choose 100 winning (red) and 100 losing (blue) 
horses randomly and follow their ranking as the betting 
proceeds. One losing horse is tagged by yellow dots. 

Figure [5] shows the time change of the ranking pro- 
cess of the horses. We choose 100 horses for each cate- 
gory I a = {1,0} and follow their ranking as the betting 
proceeds. We use Av = 2000 as the time step. Dur- 
ing one hundred steps, the average number of votes t 
changes from to for v = (bottom) to about 178116 
for v = 200000 (top), v = corresponds to the first 
announcement r — 1 and t — to- At to, the horses 
are arranged in the sequence at random. The win bet 
fractions x a (to) do not contain much information about 
the strength of the horses. As the betting progresses, 
the phase separation between the two categories of the 
horses does occur. Winning (losing) horses move to the 
left (right) in general. In the end, to the left (right) are 
more winning (losing) horses. In the betting process, 
betters have succeeded in choosing the winning horses 
to some extent. We also note that there remain win- 
ning horses in the right. This means that we can find 
winning horses with very small win bet fractions. 
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Figure 3: (a) Double-logarithmic plot of the ROC 
curve (xo, x-y). We plot four ROC curves for t = U, i € 
{0, 1, 2, 3, 4}. The fitted lines with x\ — a-x^ for to and 
^3 are also plotted, (b) Double-logarithmic plot of the 
ROC curve (x , x x ) for the data of all 1986-2006 JRA 
races. We also show the fitted line with x\ — a ■ Xq. 



3 Emergence of scale invariance 

In order to see the existence of a winning horse with 
a very small win bet ration or very low rank, we cal- 
culate the cumulative functions of the winning horses 
and that of the losing horses counted from the lowest 
rank (right). More precisely, we study the following 
quantities. 

1 - 

I* a=JV(l-s) 

The above are normalized so that x^(s) is zero at s = 
and 1 at s = 1. The curve (xq(s), Xi(s)) is known as a 
receiver operating characteristic (ROC) curve. We are 
interested in the limit s — > 0. In particular, we study 
the scale invariant relation x\ cx Xq . If such a power 
law relation holds, we can find winning horses with any 
small win bet fraction. 

Figure [3^ shows the double logarithmic plot of the 
ROC curves (xq(s),Xi(s)). We show five curves for 
t = U,ie {0,1,2,3,4}. At t = to, there are 70 votes 



in the pool on average. The plot is almost a diagonal 
line from (0, 0) to (1, 1), which means that the winning 
and losing horses are mixed well in the sequence. The 
curve is fitted with the power relation x\ = a ■ Xq with 
a = 1.03. As the betting progresses, the curve becomes 
more downward convex shape and the slope increases. 
At t — t 3 ,t4, the degree of the phase separation be- 
tween the two types of horses reaches its maximum. 
On the other hand, we also see that there exist win- 
ning horses in the very low rank region. Each step 
upwards of the curve implies the existence of a win- 
ning horse and the upward movement starts for a very 
small Xq- In addition, we also see a straight line region 
in the curve in 0.03 < xq < 0.3. We have fitted the 
curve as in the previous (to) case and we get a = 1.77. 
This means that the scale invariant relation between 
x\ and xo also holds after many rounds of betting. 

In order to see scale invariance more clearly, we show 
the same plot for the data of all JRA races from 1986 
to 2006 in Figure [3j3 ; 6 . There are 71549 races and 
Ni = 71650 and N = 829716. The plot becomes 



straight in 0.003 < x < 0.3 with a = 1.81. 

4 Emergence of efficiency 

The win bet fraction aggregates the wisdom of the bet- 
ters in the racetrack betting market. In order to quan- 
tify the accuracy, we use two measures. The first one 
is the accuracy ratio ( AR) . AR measures how an event 
occurs in the order of a rank. Here, we consider the 
event that a horse wins in the race. Horses are arranged 
in the increasing order of the size of the win bet frac- 
tion £ {1, 2, • • ■ N}. There are A^i winning horses 
and Nq = N — Ni losing horses. As we have explained 
before, if the prediction of the betters is good, the win- 
ning horses are concentrated in the higher ranks. If the 
prediction is perfect, the first N± horses are the win- 
ning ones and I a is one for them. In order to define the 
accuracy of the prediction or to measure the complete- 
ness of the rank, we introduce a Lorenz curve (x, L(x)) 
for < x, L(x) < 1 as 



L(a;) 



N-x 



(9) 
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Figure 4: Plot of (x,L(x)) (a) and (x,EL(x)) (b) for 
t = t h i€ {0,1,2,3,4}. 

Figure 2£i depicts the Lorenz curves for t = tj,j £ 



0,1,2,3,4. At t = t , after about seventy rounds, 
the horses are arranged at random on the sequence 
a £ {1, 2, • • • , N}. The Lorenz curve runs almost along 
a diagonal line. As the betting proceeds, the winning 
horses (I a = 1) move to the higher ranks and the de- 
gree of the upward convex nature of the curve increases. 
The preciseness of the prediction increases monotoni- 
cally. At t = t%, the increase almost stops and the 
accuracy of the predictions reaches a maximum. 

In order to quantify the accuracy of the predictions 
of the racetrack betters, we use accuracy ratio, AR [L2] , 
AR is defined as 



AR: 



( / L(x)dx 
Jo 



-)/-d 

2" 2 V 



Ni. 
N ■ 



(10) 



AR measures how different is the ranking from the 
complete case. The denominator in the definition is 
the normalization factor that ensures that AR is one 
for the completely ordered case. If the ranking is per- 
fect, Jp 1 L(x)dx -3 = 3(1-7^) and AR is 1. 

We also introduce an expected Lorenz curve 
(x,EL(a;)), which is defined as 



N-x 

EL(x)^-]T. 



(11) 



9J . In order to quantify how the bets are concentrated 
or scattered among horse, we introduce expected AR 
and call it EAR. EAR is defined as 



EAR: 



( / EL(x)dx 
Jo 
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R 

N 



(12) 



If we assume that the horses are divided into two 
groups, A and B. The horses' win bet fraction in group 
A (x a , a £ A) is large and those of other horses in 
group B (xp,/3 £ B) is small, the votes are concen- 
trated on the horses in group A. EAR is large. If the 
votes are scattered among all horses, EAR is small. In 
particular, if x a = jj for all a £ N, EAR is zero. If 
x a — 1, a £ A and xp = 0, (3 £ B, EAR is one. 

Figure gb depicts EL(a:) for t = tj,j £ 0,1,2,3,4. 
At t — to, EL(x) rapidly increases to one, which im- 
plies that votes are concentrated on small number of 
horses. However, AR is small at to and the horses are 
not the winning horses. After £o> the votes are scat- 
tered among many horses and the degree of the upward 
convex nature of the curve decreases up to ti- After 
the decline, it begins to increases. At £4, the degree of 
the concentration of votes reaches a maximum, ti is 
the boundary line. Before ti, votes are more and more 
distributed among many horses. After £2, the votes 
tend to be concentrated on popular horses. 

By comparing the behaviour of L(x) and EL(ir), we 
are able to study the efficiency of the market. If the 
probability that the horse a wins is x a , the two Lorenz 
curves L(x) and EL(a;) do coincide with each other in 
the limit N — > 00. Hence the equality AR = EAR 
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Figure 5: Plot of AR and EAR as the functions of £. 
The plot starts at £o and the two curves intersects at 
t\ and £3. At £2, the discrepancy reaches its maximum 
after t\. 



is a necessary condition of the efficiency of the mar- 
ket. However, it is not a sufficient condition. Even if 
the equality holds, there is a possibility that the two 
curves depart from each other. We also note that if the 
strength of the horse at rank x is overvalued, the in- 
equality L'(cc) < EL' (2) holds. On the contrary if the 
strength is undervalued, the inequality L'(x) > EL'(x) 

holds. Here L'(x) = ^ and EL'(a:) = ^j^ 1 . 

Figure [3] shows AR and EAR as the functions of £. 
As the betting progresses, AR increases monotonically 
and it almost reaches its maximum at £ = £2. After- 
wards, the increase in AR is very slow and the following 
bets do not increase the accuracy of the prediction as to 
which horse wins the race. More interesting behaviour 
can be found in the time change of EAR. At £ = to, 
EAR is very large and is nearly 0.9. Almost all votes 
are concentrated on small number of horses. However, 
AR at t = to is small and the horses with large bet 
fractions are not so strong. The true strong horses are 
scattered all over the rank of the win bet fraction. Af- 
terwords, EAR decreases rapidly and at t = ti, AR and 
EAR coincide. The necessary condition of the market 
efficiency is satisfied at t = t\. Up to t — £2, EAR 
decreases and almost reaches its minimum. The bets 
are scattered among many horses and this implies the 
rich variety of the betters' predictions as to which horse 
wins the race. This also means that the strong horses 
are undervalued and the weak horses are overvalued, 
that is the 'favorite-longshot bias' state, which can be 
seen more clearly below. The discrepancy between AR 
and EAR is the largest at £ = £2 after t = t±. After 
that, the discrepancy decreases monotonically as EAR 
increases faster than AR. The bets begin to be con- 
centrated on more popular horses. At t = £3, AR and 
EAR coincide again. The necessary condition of the 
market efficiency is satisfied again. After £3 , the degree 
of the concentration increases further, the discrepancy 
between AR and EAR is small even at £4. 

Figure [5^- shows the discrepancies between L(x) and 
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Figure 6: (a) Plot of DL(x) = L(x) - EL(x) for £ = 
U, i G {0, 1, 2, 3, 4}. (b) Plot of DL(x) for £ 3 and all the 
data of JRA races (1986 — 2006). Inset figure show the 
plot for all the data case for < x < 1%. 



EL(x) at £ = ti,i <E {0,1,2,3,4}. On the y-axis, we 
show DL(a;) = L(x) — EL(x). As we have explained 

before, the sign of DL'(ie) = d ^'}^ tells us whether 
the horse at rank x is overestimated or underestimated. 
If DL'(a;) is zero, the strength of the horse is prop- 
erly estimated by the betters and the racetrack bet- 
ting market is efficient. On the other hand, if DL'(x) 
is positive (negative), the strength is underestimated 
(oversetimated) . 

At t = t%, DL(a;) is close to the x-axis and we see 
that for small x < 20%, DL(x) < 0. This means that 
the market is almost efficient, but top 10% popular- 
horses are overestimated and next 10% horses are un- 
derestimated. Bets are more accumulated on the top 
10% horses and their win bet fractions are larger than 
their true winning probabilities. On the other hand, 
there are less bets on next 10% horses as compared 
to their true winning probabilities. Remaining 80% 
horses' strength are properly estimated, because DL(x) 
is close to the x-axis. At £ = £2, DL(x) is positive for 
all < x < 1. From the figure, we see that the pop- 
ular 30% horses are underestimated as compared to 



their winning probabilities. Remaining 70% unpopu- 
lar horses are overestimated. The bets are distributed 
among many horses, including many weak horses and 
an inefficient state is realized. Following this, the graph 
of DL(ir) approaches the x-axis at t — £3. The coinci- 
dence between the two Lorenz curves is better than at 
t = t\. For small x (popular horses), DL(x) < and 
the strong horses are overestimated. For large x (un- 
popular horses), DL(x) is almost on the x-axis. From 
the small discrepancy, we see that the top 20% horses 
are overestimated. And next 30% horses are underesti- 
mated. However, the slope of DL(x) in the two regions 
is small and the market is almost efficient. This effi- 
cient state remains to be true even at £3. 

Figure shows DL(x) for the data of all JRA races 
(1986-2006). Contrary to the 'favourite- longshot bias', 
we see some complex behaviour. Top 0.4% horses are 
underestimated and next f 0% horses are overestimated 
and so on. However, the degree of the inefficiency is 
very small. 

5 Voting model and scale invari- 
ance 

We consider a voting model for N horses [5J. Bet- 
ters vote for them one by one, and the result of each 
voting is announced promptly. The time variable 
t £ {0, 1, 2 • • • , T} counts the number of the votes. The 
horses are classified into two categories \x £ {0, 1}, and 
we call them binary horses. There are horses in 
each category and iV + iVi — N. 

We denote the number of votes of the ith horse fi £ 
{0, 1} at time t as {Xit}ie{i,—,Nu}- At £ = 0, Xf t takes 
the initial value X^ Q = > 0. If the ith candidate /i 
gets a vote at t, X^ t increases by one unit. 
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After infinite counts of voting, i.e. T 



00, the 



share of votes , 



hmi 



becomes the beta 



distributed random variable beta(s /J , Z — s M ) on [0, 1]. 
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(16) 



Next, we focus on the thermodynamic limit 
No, N± — ► 00 and Zq = NqSq + N\Si — > 00. The expec- 

We introduce 



tation value of is < x% >= — 



a variable v% 



(Z 



l)xf. The distribution 



function p s (u) in the thermodynamic limit is given as 



lim p(a;f 

Zq — >oo 



Zq 



V 



r(v) 



(17) 

The share of votes, u, of a candidate /1 follows a gamma 
distribution function with s^. 

After many counts of voting, T — > 00, the two types 
of horses are distributed in the space of u according 
to the gamma distribution in the thermodynamic limit 
Zq — > 00. If si > so, a candidate belonging to category 
p, = 1 has a higher probability of getting many votes 
than a candidate belonging to category fj, = 0. Even 
the latter can obtain many votes. It is also possible 
that the former may get few votes. Thus, there is a 
mixing of the binary candidates. 

The cumulative functions x fJl (w) is given as 



(18) 



x^{w) = I p s (u)du. 



Using the incomplete gamma function of the first kind 
7(s, w) = j Q e~ u ■ u s ~ 1 du, it is given as 



r(v) 



l{s», w )- 



(19) 



A better casts a vote for the N candidates at a rate 
proportional to X? t . The probability P? t that the ith 
candidate /z gets a vote at t is 



Z t ' 

1 W M 



(13) 



Z t = Y,^X? t = N lSl +N so+t. (14) 

f_t=Q i=l 

The problem of determining the probability of the 
ith candidate /i getting n votes up to T is equivalent 
to the famous Polya's urn problem [10l [11] . The prob- 
ability that the ith candidate \x gets n votes up to T is 
given by the beta binomial distribution 



Near the end point, w — » 0, in other words, in the 
small u region, the incomplete gamma function 7(s M , t) 
behaves as 

7(«/.,«0 (20) 



As x Sii (w) oc w Sft , the following relation holds: 

81 



X\ ~ Xq 



with 



so 



(21) 



We see that a scale invariant behaviour appears in the 
mixing. 

Furthermore, in the limit (si, sq) — ► (0, 0) with fixed 
a = si/sq, next relation holds [BJ. 



X\ — Xq 



(22) 



Prob(Xf r -s M = n) = T C n -— (ry - s . (15) 



(Zoh 



(a) n is defined as (a) n — F p° ^"^ . 



The scale-invariant relation holds over the entire range 
< xq , x\ < 1. This feature is remarkable from the 
viewpoint of statistical physics. Usually, the power- 
law relation holds only in the tail. 




Figure 7: Exact scale invariant gradation pattern with 
varying a from a = 1 (bottom) to a — 100 (top). 

We discuss the limit in the derivation of the ex- 
act scale invariance. In the derivation of the gamma 
distribution, we take the thermodynamic limit Zq = 
NiSi + NqSq — > oo. With the gamma distribution, 
Xi = x% holds in the limit {s A J -> 0. For ^ to 
hold, these two limits, Zq — > oo and {s M } — > 0, should 
go together. {s M } approaches zero more slowly than 
{N^} approaches infinity. We call the limit Zq — > oo 
and {s M } — > with fixed a — s\j sq as the double scal- 
ing limit. If we take the limit {s M } — + without the 
limit Zq — ► oo, the firstly chosen candidate gets all the 
remaining votes and there is no mixing of the binary 
candidates. The double scaling limit is crucial to the 
emergence of the exact scale invariance. 

6 Exact scale invariant grada- 
tion pattern 

The voting problem reduces to a random ball removing 
problem with the relative probability s M in the double 
scaling limit [B]. Figure 11 shows the gradation pattern 
made by the algorithm of the random ball removing 
problem. We prepare N% red balls and Nq blue balls. 
From them, we take one ball at a time and do not re- 
turn it. The probability that a red (blue) ball is chosen 
is proportional to si(so). We repeat the procedure up 
to when there remains no ball. We get a sequence of 
N = Ni + N balls. In the limit iVi,iV"o -> oo, the 
exact scale invariance between x\ and xq holds. In the 
figure, we change the ratio a = |^ from 1 (bottom) 
to 100 (top). Near the bottom, two types of balls are 
mixed. Near the top, phase separation occurs. 

7 Summary 

In this paper, we study the time series of win bet frac- 
tion of the 2008 JRA races. We see that the betting 
process induces scale invariance between the cumula- 
tive functions of the winning horses and that of the 
losing horses. We also see the emergence of the mar- 
ket's efficiency after many rounds of betting. However, 
the convergence to the efficient state is not monotonic. 



The dynamics of the distribution of the votes among 
the horses is complex. At first, the votes accumulate to 
small number of horses and then they are distributed 
among many horses, including weak horses. At this 
time, the strong horses are underestimated. After- 
wards, votes begin to be concentrated on more popular- 
horses, but the ranking of the winning horses does not 
change so much. AR does not change much and only 
EAR increases, and AR = EAR finally holds. 

With regard to the scale invariance, we explain the 
mechanism based on a simple voting model. The model 
shows exact scale invariance in the double scaling limit. 
In addition, the voting model in the limit is equivalent 
to a random ball removing problem. Using the equiva- 
lence, we show how to make an exact gradation pattern 
of mixed binary objects. 
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